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Abstract 

Linear-assignment Gel'fand-Pinsker coding (LA-GPC) is a coding technique for channels with interference 
known only at the transmitter, where the known interference is treated as side-information (SI). As a special 
case of LA-GPC, dirty paper coding has been shown to be able to achieve the optimal interference-free rate for 
interference channels with perfect channel state information at the transmitter (CSIT). In the cases where only the 
channel distribution information at the transmitter (CDIT) is available, LA-GPC also has good (sometimes optimal) 
performance in a variety of fast and slow fading SI channels. In this paper, we design the filters in nested-lattice 
based coding to make it achieve the same rate performance as LA-GPC in multiple-input multiple-output (MIMO) 
channels. Compared with the random Gaussian codebooks used in previous works, our resultant coding schemes 
have an algebraic structure and can be implemented in practical systems. A simulation in a slow-fading channel is 
also provided, and near interference-free error performance is obtained. The proposed coding schemes can serve as 
the fundamental building blocks to achieve the promised rate performance of MIMO Gaussian broadcast channels 
with CDIT or perfect CSIT. 

Keywords: MMSE filter, lattice coding, dirty paper coding 

I. Introduction 

Gel'fand and Pinsker [1] first considered the issue of communication with interference noncausally 
available at the transmitter but not available at the receiver. Recently, many renewed interests arose in the 
applications of a subclass of this problem called the linear-assignment Gel'fand-Pinsker coding (LA-GPC), 
where a linear strategy is used [2]. Costa [2] [3] first applied the LA-GPC in additive Gaussian noise 
channels, and revealed a surprising result that by treating the Gaussian interference as the side information 
(SI), the interference-free rate is achievable even when the SI is known only at the transmitter. Costa named 
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this special case of the LA-GPC as dirty paper coding (DPC). The DPC result is based on the assumption 
that perfect channel state information at the transmitter (CSIT) is available. That is, the fading coefficients 
of the wireless channel are perfectly known not only to the receiver but also to the transmitter. However, 
it is hard to have perfect CSIT in the wireless setting. Typically, the channel coefficients are estimated 
at the receiver and fed back with limited feedback channel bandwidth to the transmitter. In practice, we 
can assume that only the channel distribution information at transmitter (CDIT) is known and adopt the 
general LA-GPC. For scalar slow fading SI channels, LA-GPC was shown to have the interference-free 
outage performance [4]. For fast fading channels, the LA-GPC also has good (sometimes near optimal) 
rate performance in scalar and multiple-input multiple-output (MIMO) settings [4] [5]. 

One of the most important applications of the LA-GPC is the MIMO Gaussian broadcast channel (GBC). 
A MIMO GBC system consists of one transmitter sending information to many receivers, all equipped 
with multiple antennas. With perfect CSIT, the capacity region of MIMO GBC was shown to coincide 
with the achievable rate region when MIMO DPC is utilized [6]-[10]. The key to this capacity-achieving 
performance is that MIMO DPC can efficiently use the information of the multi-user interference, known 
at the transmitter, to make the receiver decode messages with a rate as if the undesirable interference 
does not exist. With only CDIT, the MIMO DPC does not perform well [11]. Using the general MIMO 
LA-GPC has been shown to have significant rate gains over applying the MIMO DPC naively and other 
beamforming-based strategies in the ergodic fast fading MIMO GBC [11] [5]. In the scalar slow fading 
GBC, using LA-GPC also provides a significant gain over the time-division scheme [4]. In contrast to 
DPC for which structured codebook designs are well known [8]-[10], [12], all current promising results 
of the LA-GPC [4], [5], [11] are based on unstructured random Gaussian codebooks. Lack of structured 
codebooks so far hinders practical applications of the LA-GPC. 

In this paper, we show that with judiciously designed spatial filters, good nested-lattice coding can 
achieve all achievable rates of the MIMO LA-GPC. Unlike codebooks used in the previous works [l]-[5], 
[11], lattice codebook has an algebraic structure and is possible to be implemented in practice. We rewrite 
the LA-GPC rate function in a non-trivial equivalent form. This new form motivates the subtle selections 
of the transmitter SI filter and the receiver filter to achieve the LA-GPC rate with lattice codes. We also 
provide a simulation for slow-fading channels, and near optimal interference-free error performance is 



obtained. Our coding can be directly applied to fading MIMO GBC with CDIT to obtain the rate gains 
derived in [4] [5] [11]. As a by product, we also propose a new structured MIMO DPC which achieves 
the optimal interference-free rate when perfect CSIT is available. Thus while being applied to the MIMO 
GBC systems with perfect CSIT [7], our MIMO DPC is superior to other existing sub-optimal works 
[13]-[15]. In summary, the main contributions of this work are 

1) We provide the methodology to construct the SI and receiver filters in the nested-lattice coding 
to make it achieve the LA-GPC rate. This rate was achievable previously only with unstructured 
random Gaussian codebooks [l]-[5], [11]. Although in [l]-[5], [11] LA-GPC and DPC seemed 
only different in their "linear-assignment matrix" selections in the strategy function, this difference 
will in fact change the entire random codebook design and the decoding rule [11]. In other words, 
naively using DPC (designed with perfect CSIT) and dealing with the fading statistics separately 
for the fading SI channels with CDIT is not a good approach, and will result in a rate strictly lower 
than the one achieved by the LA-GPC. An example of this rate loss can be found in [11, Section 
rV.B], and more discussions will be given in Section UlI] and |VIl In this work, we show that with 
lattice coding, the receiver filter must be different from the transmitter SI filter for fading channels. 
Such result contrasts with the common practice in the lattice-based DPC [8], and also verifies the 
above observations. Our derivations are new even in the scalar case, and our numerical examples 
validate this result. These numerical examples are the first real implementations having near optimal 
performance with finite codelengths in SI channels. All prior simulation results [8]-[10], [12] with 
such performances needed very long codelengths. 

2) Our transmitter is subject to a covariance matrix constraint, which is more general than the conven- 
tional power constraint over all antennas. An additional transmitter filter is introduced to deal with 
this new constraint. However, it also incurs new difficulties in the proof of our main result (Theorem 
[T]). The details and comparisons with [16] can be found in the proof. According to our Lemma [2l 
this filter will make the covariance matrix of the transmitted signal exactly as desired. This result 
extends the application of the proposed coding to MIMO GBC with general input covariance matrix 
constraints which subsume the per transmit antenna power constraints [7]. 

3) As a special case, a new MIMO DPC is also proposed. Since only the filters are adjusted in our 
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design for different fading conditions, our construction is the first unified design using structured 
codebooks for MIMO SI channels with perfect CSIT or only CDIT. Currently all other existing 
MIMO DPC designs, for example, the superposition coding vector DPC [10] or combining scalar 
DPC with vector channel diagonalization [8], [12], need full CSIT. With only CDIT, these designs 
all have difficulties to achieve the LA-GPC rate. Our MIMO DPC is also a non-trivial MIMO 
extension of the scalar one [8], and the detailed comparisons can be found in Section N\ 
The paper organization is as follows. We define the system model and provide backgrounds on lattice 
coding in Section |Ill Section UlI] shows our new form of the LA-GPC rate in Lemma \\\ Our main 
contribution is presented as Theorem [T] in Section |IV| Before that, our transmitter filter selection is shown 
in Lemma [2l while the SI and receiver filter selections are shown in Lemma [3l The detailed comparisons 
with [8] and the applications of the proposed scheme to MIMO GBC are provided in Section |Vl Section 
|VI] provides some numerical simulation examples. Finally, Section [Vlll concludes this paper. 

II. System model and preliminaries 
A. MIMO fading channel with side information at the transmitter 
We focus on the following MIMO channel ^ 

yt=Ht{iLt + ^t)+it. (1) 

where t is the time index and I <t <T ,T h the number of symbols in the code block; G ^ is the t\h 
received symbol, X/ G R^^^ is the rth transmitted vector symbol, Sf G R^^^ is the t\h vector interference 
signal known at the transmitter as the SI, M and are the number of transmitting and receiving antennas 
respectively; Ht G R^^^ is the random MIMO channel matrix encountered by the transmitted signal to 
the receiver at time L z ? G R^^ ^ is the additive Gaussian noise at the receiver where Z/ ~ A'r(0, ■ The 
channel input is limited by a given input covariance matrix constraint ^E/, which is positive semi-definite. 
This real model can be easily modified to encompass the complex signal model, as shown in Section |Vl 

*In this paper, entropy and mutual information are denoted by /!(•) and /(;), respectively. Deterministic and random matrices are denoted 
in bold-face and italic capitals, respectively. For matrix G, Tr(G) and Rk(G) denote the trace and rank; G^ and G^ denote the transpose and 
conjugate transpose, respectively. G7' and |Gj| are the inverse and determinant of a square matrix Gj. And I„ denotes the identity matrix 
of dimension n. The partial ordering between symmetric matrices are denoted by >- and for example, Gi >z_ G2 means (Gi — G2) is a 
positive semi-definite matrix. And for a bounded Jordan-measurable region R c E,"' , \\R\\ denotes the volume of R. 



5 

With only CDIT, there are two kinds of fading channels considered, the slow and fast fading channels. 
In the slow fading channels [4], Ht is random but fixed within the codeword length T; while in the fast 
fading channels [4], [5], [11], Ht,t = l...r, is assumed to be an i.i.d. random process with respect to 
time. In both cases, Ht can be obtained perfectly at the receiver but only the distribution information is 
known at the transmitter. We limit the distribution of to be Gaussian in the channels with only CDIT. 
For the channels with perfect CSIT [2], [8], 1 ... T, are constant within the codeword length T and 

known perfectly at the transmitter. And can be arbitrarily distributed. 

We can rewrite ([T]) in an equivalent super channel to present our coding scheme more easily in Section 
HVl By concatenating all T symbols, ([U) becomes 

y = //(x + s)+z, (2) 

where x = (x|, . . . , x^)^; the noncausally known transmitter SI s and the noise term z are obtained similarly 
from S; and respectively as x from x^. The covariance matrix of z is ^lA^r- The dimension of the real 
block-diagonal channel matrix H is NT x MT , with its tih. diagonal term as Hf. We also form the channel 
input covariance matrix constraint as 

EG = ^Ir®S/, (3) 

where ® denotes the Kronecker product. It means that the same constraint applies to all vector symbols 
within a codeword. The input covariance constraint is Eg >i '^x, where is the covariance matrix of x. 

B. Review of lattices and lattice quantization noise 

An m/^-dimensional real lattice A is defined as A = {Gb : b G Z"'^}, where G is the mi x generator 
matrix of A. We assume that G is full rank as in [16], and the lattice is nondegenerate [17]. Let Q. be any 
fundamental region [18] of A, the lattice quantizer associated with ^ with quantizer input g is defined as 
= ^5 if g G + The modulo-A operation associated with Q is then 

gmodnA = g-2n(g)- (4) 

Let u be a dither uniformly distributed in Q. and independent of g. It is proved in [18, Lemma 1] that the 
dithered quantization error (g + u) modfj A is also uniformly distributed in Q as u, and independent of 
g. The autocorrelation matrix of this error is Ef^ = E{u(u) }. Since the lattice is nondegenerate, Ef^ is 



positive-definite and nonsingular. One important fundamental region of A is the Voronoi region 'V , which 
is the set of points g G R™^ that are closest to in Euclidean distance than to other lattice points Xe A. 
The second moment [18] associated with this region is denoted as P{i^)- 

III. MIMO LINEAR- ASSIGNMENT Gel'FAND-PINSKER CODING AND ITS ACHIEVABLE RATE 

In this section, we will introduce the MIMO LA-GPC and its achievable rate, denoted as Rla, using 
random Gaussian codebooks. The new formula of R^a in Lemma [T] of subsection IIII-AI will play an 
important role in building surprising connections between Ru^ and the achievable rate of the proposed 
coding in Section |IVl To illustrate the MIMO LA-GPC, we presents the following channel as in Fig. [T] 
(a), which represents ([B in the Shannon random-coding setting as Q. 

y^ = H(X^ + 5^)+Z^ (5) 

where ^ is an N x M random matrix. For simplicity, we first consider the full CSIT case. Without loss 
of generality, we can replace H_ with deterministic H as in Fig. \T\ (b). Using binning technique on the 
random codebook [1], the rate 

/(C/^;y^)-/(C/^;5^) (6) 

is achievable for any particular choice of p{u\s) and /(■), where is an auxiliary random vector with 
distribution specified by the conditional distribution p{u\s), and /(■) is a deterministic strategy function 
such that = f{U^,S^). The LA-GPC uses the following "linear-assignment" strategy with random 
Gaussian codebooks as 

[/^ = WBS^-fX^ (7) 

where ~ N^{Q, {Hi) is independent of S^, and Wb is an M X M matrix. Note that this strategy 
specifies the function = f{U^,S^) as X^ = U^- WrS^. Costa showed that if Wr can be selected 
according to the full CSIT H, then the optimal interference-free rate is achievable [2], [3]. He then named 
this special LA-GPC as DPC. 

We now consider channels with only CDIT as in (|5]). The ergodic fast fading case is first considered, 
where the channel random process is assumed to be i.i.d. for every time slot [4], [5], [11]. The optimal 

^To emphasize the differences between the lattice codebook setting and the unstructured Shannon random codebook setting, signal vectors 
in the former are denoted in bold-face lower-cases while those in the latter are denoted in italic capitals with the superscripts specifying 
their dimensions. 
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Strategy for this channel is still an open problem due to lack of full CSIT. Thus [4], [5], [11] focused on 
the the achievable rate 



with the "linear-assignment" selection Q- The maximum of ([8]) over all linear assignment matrix Wb 
calculated with only CDIT is called the "linear-assignment" capacity. Although only the selection of 
matrix Wb is different compared with the DPC, this change will change the random binning codebook 
design. Moreover, the decoder will also be different. The decoder in [2], [3] seeks a codeword that is 
jointly typical with , while the LA-GPC decoder in ergodic fast fading channel seeks a codeword that 
is jointly typical with both Y^ and H_ [11, Sec. IV-B]. In both scalar and MIMO fast fading SI channels, 
the linear-assignment capacity is close to optimal in some signal-to-noise ratio (SNR) regions [4], [5]. 

For the quasi-static slow fading channel [4], the decoding error probability cannot be arbitrarily small 
since the transmitter does not know the reliable transmission rate with only CDIT. In this channel, the 
outage probability [19] for a given transmission rate R is a better metric than the Shannon capacity to 
measure the performance. Define Rla(R) = iiU^;Y^\H = n) -I{U^;S^), this probability is 



where H is a realization of H_. In [4], it is shown that LA-GPC achieves the interference-free outage 
performance in the scalar channel with properly selected Wb according to the CDIT. 

A. Achievable rate of the MIMO LA-GPC with random Gaussian codebooks 

We now explicitly compute Rj^, the LA-GPC achievable rate, using random Gaussian codebooks. The 
linear assignment matrix Wb is assumed to be determined in advance according to the CDIT as in [4], 
[5]. Since both the achievements of the linear-assignment capacity ([8]) in ergodic fast fading channels [4], 
[5], [11] and the outage probability ^ in slow fading channel [4] are based on the coding achieving 
Ru^ in a certain channel realization H_ = H [19], we will first focus on this case. Note that this H is 
only partially known at the transmitter. Also to simplify the presentations in the following sections, as in 
Section [III we concatenate T random vector Y^ in (|5]) as 



I{U";Y^,H)-I{U^;S''), 



(8) 



F{H:Rla{H)<R}, 



(9) 



Y 



■NT 




rNT 



(10) 



where the channel realization H corresponds to H in Q, X ~ A^]a(0,E(j) (Xg is defined in ([3])), 5 
A^]r(0, and Z'^^ ~ A^]a(0, ^Ia^j-), respectively. The covariance matrix ^E^ is block-diagonal. 
To show the LA-GPC rate, we rewrite channel (flOl) as 



= H(V2E^)1'"^ + H5^^+Z^^ = Hl'"^ + H5^^ + Z^^, (11) 

where H = Hv^E^ and E^ is an MT x Rk(EG) matrix which satisfies 

E^(E^)T = Eg. (12) 

The niT x 1 random vector X'"^ is distributed as A^r(0, ^I^r) and independent of S^^ and Z^^, where 
niT = Rk(EG). Note that ~ A^]r(0,Eg) is distributed the same as V2E^X'"^. 
We focus on the achievement of the following LA-GPC rate as 

i?^ = {/(f/'"^;y^^)-/(f7'"^;5^^)}/r, with f7'"^ = W5^^+l'"^. (13) 

Here W is an niT x MT block-diagonal matrix satisfying v^E^W = (g> Wb- The matrix Wb is computed 
in advance according to the CDIT as in [5] [4]. Note that E^ is also block-diagonal. Comparing f/'"^ 
with U'^ in (|7]), we have \t®U^ = v^E^f/'"^, where Ir is a r X 1 vector with all elements equal to 1 . 
We have 

Lemma 1: Let Y^Eu be the covariance matrix of the linear minimum mean-square error (LMMSE) 
estimation error £'™^yi^5£ to estimate 0""^ in from Y^^ in ([TI]) with LMMSE estimation filter 
Wu,MMSE, then the LA-GPC achievable rate using random Gaussian codebooks is 

^LA = Z7pl0g^^. (14) 

Proof: From ([H), 

Rla = {h{U"'^\S^^)-h{U'"^\Y^^))/T. (15) 



Due to the linear assignment strategy in (1131) . the first term becomes 

h{U"'^\S^^) = h{X"'^\S^^) = h{X"'^), (16) 

where the second equality comes from the independence between and X"^^ . As for the second term, 
we use the concept of the backward channel in the LMMSE estimation [20] to express U'"^ as 

(17) 
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Then 

where the last equality comes from the fact that the LMMSE estimation error E^^j^j^^^ is independent of 
[20]. Using ([Ml) and ([H in ([Bj) and recall that 1"'^ ~ N^{Q, we have d). □ 

IV. Nested Lattice coding with spatial filtering 

In this section, we will show that combining the proposed spatial filters and "good" nested-lattice 
coding, Ru, in Lemma [T] is still achievable under the transmission input covariance matrix constraint Eg 
without using random Gaussian codebooks. As in Section IIII-A[ without loss of generality, we focus on 
the fading channel ^ with a certain realization // = H as 

y = H(x + s)+z, (19) 

where the SI at the transmitter s ~ A/^ir(0, ^E^). We assume that Eg and the linear-assignment matrix W 
in ([T3] ) are given by [5] [4] according to the available CDIT. We define the nested lattice codes as 

Definition 1: Let A^- be a lattice and A^ be a sublattice of it, that is, A^ C A^. The codeword set of the 
nested lattice code is Cc = {Ac mod A^} = {Ac H i/"^}, where 'Uq is the Voronoi region of A^. 

We choose the code rate of nested lattice code as i? = ^^logH'^'^ll/lli^cll, where 'P'c is the Voronoi region 
of Ac. The dimensions of lattices are mT, where mT is defined right after ([T2|) . Our encoding/decoding 
scheme is as follows. 

Transmitter: The transmitter selects a codeword Cc E Cc according to the message index and sends 

x = Ft((cc-FsS-u)modA^), (20) 

where the dither signal u, uniformly distributed in 'Vq and independent of the channel, is known to both 
the transmitter and receiver. The subscript 'Vq for the modulo is omitted for brevity, that is, g mod A^ = 
^ — Qvqi'^-, Vg G R™-^. The transmitter filter Ft and the SI filter Fg will be determined later in Lemma |2] 
and [3l respectively. 
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Decoder: After passing x through the channel in (fT9l) . the decoder performs signal processing on the 
received signal and gets 

y = L(Fry + u), (21) 

where the receiver filters Fr and L will be determined in Lemma [3] and Theorem [H respectively. We use 
the generalized minimum Euclidean distance lattice decoder [16] to decode Cc- First the decoder finds 

b = argmin|y-LG,.b|2, (22) 

where Gc is the generator matrix of the channel coding lattice A^. And the decoded codeword is Cc = 
[Gcb] modA^. 

We will show the selection of filters Ft, Fs, Fr and L in the following lemmas. First, let the auto- 
correlation matrix of the dithered quantization error of A^ be . Since the lattice Ag is nondegenerate, 
>- 0. We can apply the Choletsky factorization [21] to obtain the matrix E^ satisfying 

e;(e;)T = e^. (23) 

The matrix E^ is lower triangular and nonsingular. And we have 

Lemma 2: Let Ft = E^(E;)-^ where E^ and E^ are defined in (fT2)) and (|23]) . then the transmitter 
covariance matrix E^ satisfies the covariance constraint Eg h E^ since = Zq. 



Proof: First note that from dH) and (|20l) . the transmitted signal x can also be expressed as 

X = Ft (Cq + Cc - FsS - u) = Ftx, (24) 

where Cq = Qa^ ( -Cc + FsS + u) and 

X = Cq + Cc-U-FsS. (25) 

Indeed, x is the lattice quantization error, which is independent of the interference s and distributed as u 
according to Section IH-BI Then x is distributed as FtU. It is zero mean due to the fact that I'q = —i^q, 
thus its autocorrelation matrix equals to its covariance matrix = FtE^Ft^. With our selection of Ft, 
= Eg due to (fT2)) and (|23|) . and the constraint is satisfied. Note that according to ([3]), Eg ^ since 
E/ >: 0. Thus, the full column rank matrix E^ satisfying (fT2l) always exists [21]. □ 
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Now we provide the selections of the SI and receiver filters Fs and Fr to make connection between 
the lattice coding achievable rate and Ru^ in Lemma [H These filters are selected according to the linear 
assignment matrix W in (fT3l) and the LMMSE filter Wu,mmse to estimate the auxiliary random vector 
f/™^ in Lemma [T] as 

Lemma 3: Let the filter Fr be V^E^Wu,mmse, Fs be v^E^W, respectively, where is defined in 
(l23l) . Then y in (|2T]) equals to L(c'(. + e), where G A^ + Cc G Ac and 

e ^ (FrHp - W)u + (FrH - Fs)s + FrZ. (26) 

Here Hp — HFt with Ft specified in Lemma [2l Moreover, 

2^ log ^=i?LA, (27) 

where E^ is the covariance matrix of e and E^ is defined in (|23l) . 

Proof: The proof of y = L(Cc + e) is shown in Appendix lAl As for (l27l) . we first let Fr = v^E^Wr 
where is an mT x NT matrix. It will be shown that the optimal Wr maximizing the left-hand- side 
(L.H.S.) of (l27l) is Wu,MMSE- First we show that 

I^eI = 2|E^||E(//|, (28) 

where E(// is the covariance matrix of 

The Gaussian vectors Y^^ and [7'"^ are defined in (fTTl) and (fT3] ). respectively. To see this, from the 
definitions of Y^^ and f/™^, equals to 

(WrH - I„r)^"^ + (WrH - W)5^^ + WrZ^^, (29) 

where H is defined right after (fTTl) . We observe that both E^'^ and e are zero mean. The dither u 
is uniformly distributed in the Voronoi region "Vq of A^, thus the covariance matrix of u is E^. Since 
X"^T r^N{0, ^Imr) by definition, the covariance matrix of u is equal to the covariance matrix of \/21L*^X"^^ . 
By definition, s and z are of the same distributions as 5"'^ and Z^'^ , respectively. Also u, s and z are 
independent. Using these facts, and comparing the chosen Fr,HF, and Fs in (|26l ) with Wr,H and W in 
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(|29l ). it is easy to check that equals to the covariance matrix of \/TL*E'^'^. And (|28] ) is valid due to 



Now we have \1:^\/\Le\ = \jlmT\/\^u'\ due to (EH]). Since E^^ = WrF^^ -f/"'^ is the estimation error 
of f/"'^ from via the linear transform Wr, choosing Wr = ^v.mmse ^^^^ minimize \^u'\ according 
to [22, P.2390]. Thus, choosing Wr = Wu,mm5£' the L.H.S. of (1271 ) will be maximized, and Zyi equals 



Finally, combining the previously specified filters with a "good" nested lattice, the optimality of resultant 
encoding/decoding scheme is given by the following Theorem. The detailed definition of the "good" nested 
lattice is omitted, and can be found in [16], [18]. 

Theorem 1: Let filters Ft, Fs and Fr be selected as in Lemma [2] and Lemma (3] respectively, and the 
second moment P{i^q) of be 1/2. If L in is chosen as L = E^(E|)"^ in which = E^ 

and E^ is defined in (|23]) . based on sequences of "good" nested lattices, the coding scheme specified in 
(120])-(I22]) is able to achieve the LA-GPC rate Rla when T ^ oo. 

Proof: First note that Ze^ is the covariance matrix of the LMMSE error E'^^j^^^^ as in Lemma [T] 
thus it is always invertible [20]. From the Proof of Lemma [3] E^ = then L always exists since 

E^ is also invertible. Basically, we will prove that if 



the specified filters will make the lattice decoding error approach zero as T oo. The final equality of 
(|30]) was proved in Lemma [3] This proof is a non-trivial extension of [16, Thereom 5], where channels 
without transmitter SI was considered. Compared with that proof, we propose new filter selection methods 
tailored for MIMO SI channels with only CDIT as shown in Lemma [Tl [2] and [3] Moreover, in [16], the 
transmitter is subject to a conventional average power constraint. In our case, the filter Ft designed for 
a more stringent transmitter covariance matrix constraint Eg will make the proof to upper-bounding the 
decoding error probability more involved. The shaping of the lattice quantization noise and its related 



to Leu in (dl]). Then ^} is proved. 



□ 




(30) 
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properties [17] will play an important role in solving this problem. The technical details can be found in 
Appendix El □ 

V. Discussions 

As a special case of Theorem [U we also propose a MIMO DPC for SI channels with perfect CSIT. The 
optimal linear-assignment matrix W is Wmms^H, where Wmm5£ is the LMMSE filter used to estimate 
in (pT)) with zero interference S'^^ = 0. When T ^ oo, it can easily be checked that with the selected 
W, Fr = Wmm5£ and i?LA becomes the interference-free rate 

^log(|HEGHT + ll^^|/|lw|). (31) 

If we treat Hs as SI, then Fs = Fr = Wmmse- There are other features that make this MIMO DPC not a 
straightforward extension of the scalar one [8] [18]. 

1) Our transmitter is subject to a covariance matrix constraint Eg, instead of the conventional power 
constraint in [18]. The transmitter filter Ft is added for this new constraint. The selection of Ft in 
Lemma [2] depends on the lattice quantizer chosen. It is more involved than the extension from scalar 
to MIMO DPC using Gaussian random codebooks in [3], where one can directly set the covariance 
matrix of the Gaussian random vector (which generates the Gaussian codebook) to Eg to meet this 
new constraint. Also due to this constraint, we select Wmmse according to an equivalent channel 
H defined right after (fTTI) instead of the straight-forward one H . 

2) With full CSIT, our key observation dTT] ) in the achievement proof is equal to the information- 
lossless property of the LMMSE estimation [20] in the interference-free channel. Compared with 
the simple algebra used in [18, pp. 2296] to compute the achievable rate, this property provides 
new insights to the achievement of the interference-free rate. 

3) We chose a different decoder (l22l) compared to [18]. This lattice decoder can benefit from practical 
lattice-decoding algorithms [23], which makes the simulations in Section |VI] possible. Our proof of 
Theorem [His tailored for this lattice decoder, and is completely different from the proof in [18]. 
In fact, in the MIMO case the equivalent noise e in (|26l) is colored. This makes direct extension of 
the proof in [18] to the MIMO case tedious and difficult. Our proof avoids this problem. 

Finally, we briefly sketch the methods to apply the proposed coding to MIMO GBC with full CSIT. 
For MIMO GBC with CDIT [4], [5], [11], these methods can also be applied easily. Consider a MIMO 
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GBC system with K users and M transmitter antennas. The sum of the coded signals of all users will 
be sent to all receivers. Without loss of generality, we focus on the coding scheme for a user j. The tih. 
received complex symbol y^' j for this user who has Nj receiver antennas, can be written as 

y;V=H;(E4J+<, (32) 

where , G C^^ \ 1 < A: < isT, is the t\h vector symbol of the message of user k, W- G C^>^^ is the MIMO 
channel gain matrix, and n*^, G C^^^^ is the additive Gaussian noise at the receiver where n^' ~ A^([;;(0,Ia^.). 
The optimal coding scheme for MIMO GBC [6] will first specify the MIMO DPC achievable rates i?fcS by 
determining the encoding order for all users and the covariance matrix constraints E^^s for x^^, and apply 
the MIMO DPC on each user's message [6], [7]. Whether or how a user's signal will be interfered by 
the other users' signals is governed by the MIMO DPC encoding order. In general, the signals encoded 
earlier will be invisible to the signals encoded later, while the former will be interfered by the latter. 

Assuming that user j is encoded after all the users with indices larger than j, it must cancel these 
interferences. To do this, we rewrite (|32|) as 

y5,=H5x5, + H5( f x^,) + « + H5(£x^,)). (33) 

k=j+\ k=\ 

By concatenating the real and imaginary parts of the complex vectors for all T symbols similarly to 
Section Ull (|33l) can be recast as an equivalent real channel fitting (fT9l) . where Eg corresponds Ey, the 
second term in (|33l) corresponds to the transmitter SI Hs, and the third term corresponds to the noise. 
Although this noise is not white and Gaussian as in (fT9l ), the former property can be resolved by the 
standard whitening filtering approach, while the latter is met since when T ^ oo, u approaches Gaussian 
[17], then all users' transmitted signals approach Gaussian according to the Proof of Lemma|2l From (|3TI) . 
the optimal specified rate Rj = log(|I/^^. + J^^<^H^Eyt(H^)^|/|lAr^ -|-L,t<jHjE^(Hp'''|) is achievable when 
r — > oo. Another requirement in [7] is that all users' signals are mutually independent. This requirement 
is met since for each user, the dither u also makes the transmitted signal x independent of the interference 
s according to Section HTBl The details of the above statements can be found in [24]. 

VI. Numerical examples 

In this section, numerical examples are presented to demonstrate the performances of the proposed 
filters with practical lattice coding schemes. To achieve the rate performance specified in the previous 
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theorems, good nested-lattice encoding/decoding algorithms tailored for a very long codeword length (i.e. 
r — i> oo) are needed. These results may be approached practically by combining the proposed filters with 
the complex code design methods proposed in [9], [10], [12], which are beyond the scope of this paper. 
In this section, we alternatively examine the error performance at high SNR with a reasonable codeword 
length (and decoding latency). A Fano sequential-decoding based lattice decoder [25] is used to solve 
(|22l) with a good performance. 

For simplicity, we consider complex scalar slow fading channels with only CDIT as examples. Using 
the methods described in Section |Vl the channel can be recast as a real MIMO 2 by 2 channel. The 
optimal linear- assignment matrix W in Section UlI-AI is 

1-2-^ 
1-2-^ 

where R is the code rate, and the LA-GPC can achieve the interference- free outage performance [4]. The 
optimal W for general MIMO slow fading channels is unknown and finding it is very hard and beyond 
the scope of this paper. The results in [4] were reached using a Gaussian random codebook ensemble with 
r — i> oo. As shown by the simulation results in Fig. |2l with the proposed filters, the interference-free error 
performance can almost be achieved at high SNR using finite length random lattice codes and decoders 
in [25]. The fading coefficients are generated as i.i.d. circularly symmetric complex Gaussian random 
variables with variance equal to 1. As for the lattice ensemble, as in [16], we use the pair of self-similar 
nested lattices drawn from the ensemble of Construction-A lattices defined in [26] . The parameters of the 
linear code [26] in this lattice are (n = 2T,p,k) = (12,47,6). The lattice codeword length is T = 6. A 
large Gaussian distributed interference signal is added to make SNR 10 dB much larger than the signal-to- 
interference and noise ratio (SINR). Two different rates, 2 and 4 bits per channel use, are simulated, and 
the block error rates are obtained by averaging over at least 10000 channel realizations at high SNR. The 
small gaps between the error curves of random lattice codes and the interference-free outage probabilities 
in Fig. |2] demonstrate that the decoder decodes as if the interference is almost completely cancelled. For 
comparison, we also present the "interference as noise" cases. In these cases, the nested-lattice encoder 
completely ignores the Gaussian SI s, and the decoder treats the interference plus noise s-fn as an 
equivalent Gaussian noise to decode the lattice codewords. In [11], applying LA-GPC was shown to have 
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a significant gain over applying DPC naively, where the latter means that the transmitter assumes the 
channel is fixed at its expected value E[H\. Since the channel in our simulations is zero mean, the "naive 
DPC" curve in [11] corresponds to the "interference as noise" curves in our simulations, which also suffer 
severely due to lack of perfect CSIT. 

VII. Conclusions 

In this paper, we focused on structured codebook designs for fading MIMO side-information channels 
where interference is known at the transmitter. We showed that the rate performance of the MIMO LA- 
GPC using random Gaussian codebooks can be achieved by carefully designed spatial filters combined 
with nested-lattice coding. With only CDIT, the proposed coding scheme has good, sometimes optimal, 
rate performance. When full CSIT is available, the proposed coding scheme can achieve the optimal 
interference-free rate. Our coding can be applied to MIMO GBC with CDIT or perfect CSIT to obtain 
the promised rate performances. 

Appendix 

A. Proof of the equivalent channel in Lemma \3\ 

According to the definition of Hp, we can rewrite the channel (fT9l ) using x in (l24l) as y = Hpx + Hs + z. 
Note that the random coding channel (fTTI) has a one-to-one correspondence to this channel. Then 

Fry + u = (x + FrHs + u) + (FrHr - I„,r)x + FrZ (34) 

Cq -f Cc -f (FrH - Fs)s + (FrHp - I^^)^ + 

{b) 

— Cq -(- Cc 4" 6, 

where equality (a) is due to (l25l) . and (b) is due to the fact that x distributed as u as in the Proof of Lemma 
121 Let c^ = Cq + Cc, then c^ G A^ + Cc G Af due to the definition of nested lattice. And this concludes the 
proof. 

B. Proof of Theorem [7] 

Before introducing the proof, we first borrow the following useful definition from [17]: 
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Definition 2: The shaping of an m/^-dimensional lattice quantizer (with lattice point ?i G A) by an 
mi X nil nonsingular matrix F is the quantizer Qq,^, for which 

ea,(g)=F-e^(F-ig). 

The shaped quantizer Qq,^ is also a lattice quantizer with lattice points "ks = FX, Xe A and the fundamental 
region = {g : F^^g G i^}, where 'V is the Voronoi region of A. 

We first consider the achievable rate for any finite T, then let T 0° to complete the proof. We use an 
ensemble of dimension mT "good" nested lattices {Aq C Ac} defined in [16]. The autocorrelation matrix of 
the quantization lattice's dithered quantization noise is E^. The fundamental volume | li^d I of the channel 
coding lattice Ac is fixed (constant with T). As in [16], we use the ambiguity lattice decoder with decision 
region — {g G K-'"^ : |Lgp < ^(1 +a), a > 0} to simplify the proof. The error probability of the 
ambiguity lattice decoder will upper-bound that of the generalized minimum Euclidean distance lattice 
decoder (|22l) . By taking expectation over the ensemble of random channel coding lattices, the average 
error probability of this decoder is upper-bounded by [26] 

EA,[P.(©r,a|Ac)]<P(e^2)r,a) + (l + P)^^, P > 0. (35) 

1 1 "^c I I 

Let us now focus on the first term in the upper-bound (l35l) . We rewrite it as 

P(e ^ 2)r,a) = P(|Le|2 > + a)), (36) 

where the distribution of e is shown in Lemma [3l Since u in (l26l) is not exactly Gaussian, we will construct 
a "noisier" Gaussian noise eg compared to the non-Gaussian Le to upper bound (l36l) as in [16]. However, 
our construction is more involved than that in [16]. First, due to the additional transmitter filter Ft in 
Lemma [21 in our case the noise eg must be constructed with the aids of the shaped lattice quantizers. 
Second, unlike [16], our eg is colored and another noise term eg is defined to use the Chemoff bound in 
[16] to complete the proof. The details come as follows. First note that u in (|26l) has covariance matrix 
E^, we rewrite it as 

(y2E;)u^, (37) 
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where u'^ = --^(Ei^)^^u. The new quantization noise u"^' is white with autocorrelation matrix jlmT, and 
from [17] we know that u^" is uniformly distributed in the region 

^^^{g:V2Llge'U,}. (38) 

This region is the fundamental region if we shape the lattice quantizer associated with by -^(L*^)^^, 
as described in Definition [21 Using [18, Lemma 11] and following in the footsteps of [16], for all g e R*"^ 
the probability density function (pdf) Pu" (g) of u^*^' satisfies 

Puw(g) < {K/Kr^^w{o{mT))pn,{g), (39) 

where the covering radius is the radius of the smallest sphere centered at the origin that contains Q.^, 
and is the radius of a sphere having the same volume as Q.^. The function (g) is the pdf of a white 
Gaussian random vector ni ~ A^]R(0,a^Im7') with 

o2 = {K)^/mT. (40) 

Now we construct the "noisier" Gaussian error vector corresponding to Le as 

eg^L-{(FrHF-W)(v^S;)ni + (FrH-Fs)(s + n2)+Fr(z + n3)}, 

where 112 ~ A/^k,(0, (o^ — l/2)E.s), ns ~ A/^k,(0, (o^ — l/2)lAr7'). z, ni, 112 and ns are independent. Since u*^ 
has covariance matrix ^ImT, jtnT = E|u^p < (RJ^,)^ from [17]. From (|40|) . 1/2 < o^, thus 112 and 113 are 
well-defined. By using (|37] ) in (|26l ) and according to (|39l ). we indeed replace u in (|26l) with -\/2E|^ni, and 
add additional noise vectors 112 and n3 to make "noisier" than Le. Then we can upper-bound (l36l) as 

P(e ^ ©r.a) < (RyRt)"'^exp(o(mr))P(|e,|2 > ^(1 +a)). (41) 

To further upper-bound (|4T]) . first note that eg is a colored Gaussian vector with covariance matrix 

L{2o^Le)17 = 2o^L^ . (42) 

The L.H.S. of (|42l) results from the facts that the distributions of ni and z + n^ are both A^]r,(0, o^Imj), and 
the distribution of s + n2 is NB.iO.o^'Ls). And the equality (|42|) is valid due to the selection L = (E|)"^ 
Since e^ is colored, the Chernoff bound used in [16] can not be directly applied. To resolve this issue, 
we define a white Gaussian vector eg = (E^)^^eg. Since E^ is symmetric, from the Rayleigh-Ritz theorem 
[21], we know that 

^min(^^ )^g^g — ^g^v ®g ~ l^gl ' (43) 
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where ?iniin(^^^^) is the minimum eigenvalue of E^^ Since is positive definite, l/Xniin(E~^) = 
^max(^v) > 0- We further denote ?imax(^^') as Xmax to simplify the notation. From (|43] ). we know 

P(|e,p > ^(1 +a)) < P f |e,|2 > :^il+a)) , (44) 

Since if |eg|2>mr(l+a)/2then \eg\^ > mT{\ +a)/2'k 

max- From (|42l) and the definition of e„, we know 

that eg ~ A^]r(0, 2a^I„jr) and the Chernoff bound in [16] can be applied to bound (f44l) . 

Finally, using (|4T)) . (|44)) and following the bounding technique in [16], for arbitrary a > 0, £2 > and 

sufficiently large T we have 

^/ ^ ^ f f , K (a"-l-loga") o(mT) 
P(e ^ ©r,a) < exp i^-mT log ^ + ^ ^ ^ ^ - 

where a" = (1 +a)/(4a^?imax)- The last inequality comes from the following facts. First, since the 
quantization lattice {A^} is "good" (defined in [16]) with second moment 1/2, E^ \^mT when T — > 0° 
[17]. From (1381) . Q-w i^q and R'^/R^, approaches the covering efficiency [16] of {A^}. Then logRJ;|,/Rf^, 
since {A^} is "good" [16]. Note that E^ \^mT then Xmax 1/2. Following [16] we know that 
1/2 and a" - 1 - log a" > 0, for some arbitrary a > 0. Thus (|45l) holds. 
As for the second term in the upper-bound (l35l) . it can be proved that if the lattice code rate meets 
(|30l ). then this term can be upper-bounded by £i/2 for an arbitrarily small £1 > and sufficiently large 
T . The proof is similar to [16] and the details can be found in [24]. 
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Fig. 1 

MIMO SIDE-INFORMATION CHANNEL WITH (A) CDIT (B) PERFECT CSIT. 
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Fig. 2 

With the proposed filters, random lattice codes achieve almost interference-free error performance for 
complex scalar slow fading si channels with cdit at high snr. 



